Combinatorial and convex optimization for probabilistic models in computer vision
نویسنده
چکیده
This thesis investigates the role of optimization in two areas of Computer Science: Computer Vision and Machine Learning. Specifically, we consider two well-known problems in Computer Vision, namely motion segmentation and object category specific image segmentation, and a fundamental problem in Machine Learning, known as maximum a posteriori (map) estimation of discrete probabilistic models. In order to address the problem of motion segmentation, we propose a novel probabilistic model which is suitable for this application. Our model includes the effects of occlusion, lighting changes and motion blur. The segmentation of a given video is obtained by performing efficient inference on the probabilistic model in two stages: (i) In the first stage, an initial estimate of the model is obtained using a novel coarse-to-fine technique which reduces the time and memory required by the sum-product belief propagation algorithm; and (ii) In the second stage, the initial estimate is refined using the observation that the energy of the model can be efficiently reduced using the αβ-swap and α-expansion algorithms. For object category specific image segmentation, we extend the probabilistic model used in previous approaches. Specifically, we incorporate it with an object category model which provides top-down information about the shape of the object. Given an image, its segmentation is determined using two new algorithmic contributions: (i) We propose efficient methods for obtaining samples of the object category models of our choice by matching them to the given image; and (ii) We make the (not obvious) observation that these samples can be quickly marginalized within the em framework using one st-mincut operation. We compare our method with the state of the art approaches and demonstrate significant improvement. Next, we present a theoretical analysis of previously proposed algorithms for map estimation which are based on convex relaxations. In particular, we show that a widely used linear programming (lp) relaxation strictly dominates (i.e. provides a better approximation than) some recently proposed Quadratic Programming (qp) and Second Order Cone Programming (socp) relaxations. We generalize this result to show that, despite the flexibility in the form of objective function and constraints offered by qp and socp, the lp relaxation dominates a large class of qp and socp relaxations. As a consequence of our analysis, we obtain two new socp relaxations which dominate the previous approaches. Finally, we consider the problem of efficiently solving the new socp relaxations. To this end, we build upon the tree reweighted message passing framework. We propose convergent algorithms which iteratively optimize the Lagrangian dual of the socp relaxations. These algorithms allow us to empirically verify our theoretical analysis using both synthetic and real data.
منابع مشابه
Global MAP-Optimality by Shrinking the Combinatorial Search Area with Convex Relaxation
We consider energy minimization for undirected graphical models, also known as the MAP-inference problem for Markov random fields. Although combinatorial methods, which return a provably optimal integral solution of the problem, made a significant progress in the past decade, they are still typically unable to cope with large-scale datasets. On the other hand, large scale datasets are often def...
متن کاملEvaluation of Convex Optimization Techniques for the Weighted Graph-Matching Problem in Computer Vision
We present a novel approach to the weighted graph-matching problem in computer vision, based on a convex relaxation of the underlying combinatorial optimization problem. The approach always computes a lower bound of the objective function, which is a favorable property in the context of exact search algorithms. Furthermore, no tuning parameters have to be selected by the user, due to the convex...
متن کاملExploiting Structure for Tractable Nonconvex Optimization
MAP inference in continuous probabilistic models has largely been restricted to convex density functions in order to guarantee tractability of the underlying model, since high-dimensional nonconvex optimization problems contain a combinatorial number of local minima, making them extremely challenging for convex optimization techniques. This choice has resulted in significant computational advan...
متن کاملSome Results about the Contractions and the Pendant Pairs of a Submodular System
Submodularity is an important property of set functions with deep theoretical results and various applications. Submodular systems appear in many applicable area, for example machine learning, economics, computer vision, social science, game theory and combinatorial optimization. Nowadays submodular functions optimization has been attracted by many researchers. Pendant pairs of a symmetric...
متن کاملLearning Structured Prediction Models: a Large Margin Approach a Dissertation Submitted to the Department of Computer Science and the Committee on Graduate Studies of Stanford University in Partial Fulfillment of the Requirements for the Degree of Doctor of Philosophy
Most questions require more than just true-false or multiple-choice answers. Yet supervised learning, like standardized testing, has placed the heaviest emphasis on complex questions with simple answers. The acquired expertise must now be used to address tasks that demand answers as complex as the questions. Such complex answers may consist of multiple interrelated decisions that must be weighe...
متن کاملA Hybrid Optimization Algorithm for Learning Deep Models
Deep learning is one of the subsets of machine learning that is widely used in Artificial Intelligence (AI) field such as natural language processing and machine vision. The learning algorithms require optimization in multiple aspects. Generally, model-based inferences need to solve an optimized problem. In deep learning, the most important problem that can be solved by optimization is neural n...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008